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Materials and Methods relating to a Plant Regulatory 
Protein 

Field of the invention 
5 This invention relates to materials and methods 

relating to a plant regulatory protein. More 
particularly, the invention relates to the cloning and 
expression of the TTGl gene of Arabidopsis thaliana, and 
homologues from other species, and manipulation and use 
10 of the gene in plants. 

Background of the invention 

The protein encoded by the TTGl (transparent testa, 
glabra) locus plays a central role in many pathways in 

15 Arabidopsis thaliana. Many of these pathways are 

confined to effects on the epidermal cell layer of 
different tissues. Mutations at the TTGl locus have a 
large range of pleiotropic effects (Koornneef (1981) 
Arabid Inform. Serv . 18 45-51). It is known that fctgl 

20 mutants have a glabrous phenotype with no leaf or stem 

hairs (trichomes) which are normally derived from the LI 
layer of cells, the outer single layer of cells covering 
the meristem that differentiates into all epidermal cells 
of the leaf. No purple anthocyanin pigments are present 

25 in the seed coat leading to the yellow cotyledons being 

visible through the transparent testa. In the wild- type 
plant, anthocyanins are present in the hypocotyl of 
seedlings and in the stem and leaves of plants as they 
age, and are inducible by many forms of stress including 

30 by high light, poor nutrients or water stress. Mutants 

of the ttgri locus completely lack anthocyanins in the 
epidermis of leaves and stems (Koornneef (1981) Arabid 
Inform. Serv. 18 45-51) . Tufts of mucilage are absent 
from Ctgl mutant seeds and the seeds show no secretion of 

35 mucilage on imbibing, unlike wild-type plants (Koornneef 

(1981) Arabid Inform. Serv. 18 45-51). In wild-type 
plants, root hairs extend from root epidermal cells only 
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in files of cells that contact two underlying cortical 
cells, whereas in ttgl mutants extra root hairs occur in 
the atrichoblast cell files (Galway et al (1994) Dev. 
Biol. 166 740-754). Seeds of ttg-l mutant plants do not 
5 require drying and cold treatments to germinate and 

exhibit an altered seed dormancy compared to ecotypes 
such as Landsberg erecta (Koornneef et al (1982) Theoret 
Appl Genet 61 385-393) . 

Several genetic loci involved in trichome, or leaf 

10 hair, differentiation and development have been described 

from Arabidopsis (Koornneef (1981) Arabid Inform. Serv. 
18 45-51, Hulskamp et al . {1994)^Cell 76 555-566). Three 
loci that play a role in the initiation of trichomes have 
been identified; these are GLl (glabra 1) , TTGl 

15 (transparent testa glabra) and TRY (triptychon) . 

Mutations at the GLl locus lead to hairless plants 
whereas the TRY locus affects the spacing of trichomes, 
which form clumps in try mutant plants (Hulskamp et al. 
(1994) Cell 76 555-566) . GLl is a MYB transcription 

20 factor (Marks and Feldmann (198 9) Plant Cell 1 1043-1050, 

Oppenheimer et al (1991) Cell 67 483-493) . Mutations at 
another glabra locus GL2 have some features in common 
with ttgl, although gl2 mutants have noirmal anthocyanin 
content and have rudimentary trichomes suggesting the 

25 fate of these cells has already been determined 

(Koornneef (1981) Arabid Inform. Serv. 18 45-51) . They 
have an increased number of ectopic root hairs, although 
the atrichoblast cells resemble wild-type cells more than 
in the ttgl mutant indicating a role later in the 

30 development of the root epidermal cells (Masucci et al 

(1996) Development 122 1253-1260) . The effects on the 
seed coat and mucilage are similar to that of the ctgl 
mutants (Koornneef (1981) Arabid Inform. Serv. 18 45-51) . 
The GL2 locus encodes another transcription factor, a 

3 5 homeodomain protein with a leucine zipper domain. By in 

situ hybridization, the GL2 gene is expressed in 
developing trichomes (Rerie et al (1994) Genes Dev. 8 
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1388-1389) and in the atrichoblast cell files of the 
roots (Masucci et al (1996) Development 122 1253-1260) . 
Transcript levels of GL2 are greatly reduced in ttgl 
mutants, suggesting that the TTGl gene product is 
required for normal expression of GL2 (Cristina et al 
(1996) Plant J 10 393-402) . 

In Ctgl mutants the anthocyanin biosynthetic pathway 
is blocked at the dihydroflavonol-4 -reductase (DFR) step 
because no DFR message is detected in these mutants 
(Shirley et al (1995) Plant J 8 659-671) whereas 
transcripts encoding chalcone synthase and chalcone 
isomerase are unaffected. This resembles the effect of 
Delila mutants in Antirrhinum (Martin et al (1991) Plant 
J 1 37-49) . In maize the equivalent locus called the R 
gene affects the whole pathway from chalcone synthase 
(CS) onwards. Delila and R are both MYC-like 
transcription factors (Ludwig et al (1989) Proc Natl Acad 
Sci USA 86 7092-7096, Goodrich et al (1992) Cell 68 955- 
964) . R has been shown to activate directly the 
transcription of several genes encoding anthocyanin 
biosynthetic enzymes in conjunction with a MYB 
transcription factor encoded by the CI gene in maize 
(Goff et al (1992) Genes Dev. 6 864-875) . Complementation 
of a ttgl mutant by cauliflower mosaic virus 35S 
promoter-i? constructs (Lloyd et al (1992) Science 258 
1773-1775) was used to suggest that TTGl might encode an 
Arabidopsis R homologue . A further characterised 
transcription factor (Caprice [CPC] - see Wada et al 
1997, Science 277, 1113-1116) may act in the opposite way 
to TTGl in promoting root hair development, and possible 
reducing the trichome number. 

The TTGl locus has been broadly mapped by Koornneef 
(Koornneef et al (1982) Theoret Appl Genet 61 385-393) to 
chromosome 5 between msl and ga3 . At position 31.5 the 
ttgl locus has been used as a phenotypic marker in many 
crosses . 
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Summary of the invention 

The present inventors have identified the TTGl locus 
as a gene encoding a WD4 0 repeat protein by 
complementation of a ttgri mutant with genomic DNA and by 
5 sequencing the gene in several ttgl mutant alleles. The 

TTGl gene has now been cloned and sequenced and the 
inventors have demonstrated that it encodes a WD40 repeat 
protein with 7 repeat units. The 1.6 kb transcript is 
present in all major organs. The identification of the 

10 product of the TTGl locus as a WD4 0 repeat protein rules 

out the possibility that the protein acts as a 
transcription factor, unlike the products of the other 
genes, GLl and GL2, affecting trichome development. 
Additionally the TTGl protein bears no resemblance to the 

15 maize R gene. The present inventors propose that the WD40 

repeat protein is a component of a signal transduction 
pathway which regulates expression or action of 
downstream transcription factors, and in particular that 
TTGl acts upstream of an Arabidopsis R homologue in the 

2 0 pathways leading to trichome differentiation and 

anthocyanin synthesis. 

The TTGl gene has a novel sequence. No Arabidopsis 
genes showing significant homology to TTGl were 
identified in public databases. A protein of unknown 
25 function showing 61% amino acid identity is encoded on 

chromosome 3 of Arabidopsis (orflO in database accession 
number X9813 0) , but transcripts of this gene do not cross 
hybridise with TTGl at high stringency on Northern blots. 
However, a region of the TTGl protein showed homology to 

3 0 an Expressed Sequence Tag (EST) of unknown function. The 

EST came from a cell suspension culture from Eco type 
Columbia (clone library AC16H) . 

Additionally the TTGl gene shows 87.5% similarity to 
the anil gene from Petunia. This gene is discussed by 
35 Vetten et al (1997) Genes £c Development 11: 1422-1434, 

Pub. Cold Spring Harbor Laboratory Press, which may have 
been published before the claimed priority date of the 
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present invention. Interestingly, the anil locus is 
described as controlling anthocyanin pigmentation and 
hence flower colour - but apparently does not exert the 
pleiotropic effects (e.g. trichomes, anthocyanin in other 
5 parts of the plant) of the TTGl gene which forms the 

basis of the present invention. 

A genomic sequence encompassing Arabidopsis TTGl has 
recently (after the priority date of the present 
invention) been put on a database under accession number 

10 AB010068. 

Thus according to a first aspect of the present 
invention there is provided a nucleic acid molecule 
including a nucleotide sequence encoding a polypeptide 
with TTGl function. Those skilled in the art will 

15 appreciate that "TTGl function" may be used to refer to 

the ability to manipulate the phenotypic characteristics 
of plants as described below when its expression is 
altered like the TTGl gene of Arabidopsis thaliana. 

Manipulation of the phenotypic characteristics of 

20 plants may be achieved by altering the expression of the 

TTGl gene (by increasing/decreasing expression or by- 
mutation) or by interfering with the normal function of 
the TTGl protein. Further, manipulation may be achieved 
by providing for the expression of a further homologous 

25 transcript which is able to interact with the expression 

of the TTGl gene in such a way as to either prevent 
translation of the transcript occurring or to boost the 
levels of transcripts being translated. 

Examples of phenotypic characteristics that may be 

30 manipulated in accordance with the present invention are 

given below. Preferably at least 2, 3, 4, 5 or 6 or more 
of these characteristics are manipulated: 
1. Trichomes (hairs) on aerial parts of plants: 
trichomes have a number of functions and the present 

35 invention provides a way to increase and decrease the 

number of trichomes on different organs to enhance their 
effectiveness. The increase or decrease in the number of 
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trichomas may be utilized in: 

(i) insect protection: due to mechanical effects 
and to chemicals from glandular hairs . These could 
be increased, to increase insect protection, on 
leaves or on cotyledons which often do not have 
hairs. Protection of cotyledons from insect attack 
may allow faster seedling growth. 

(ii) chemical production: glandular trichomas are 
involved in producing pheromones, antifeedants and 
other chemicals, including essential oils, which may 
be increased if the number of trichomas is 
increased . 

(iii) protection in hot, dry climates: hairs form 
boundary layers for decreased water loss. Hairier 
plants may have an advantage in warmer climates. 
Hairs may also provide shade and protection for 
meristem in young seedlings, allowing faster 
seedling growth. 

(iv) salt removal from leaves: the presence of salt 
glands would allow trichomas to sequester or secrete 
salt. (Relatives of rice have microtrichomes of 2 
cells which secrete salt.) 

(v) cotton fibres: it may be advantageous to 
increase the number of cotton fibres per boll, and 
at the same time decrease leaf trichomes to prevent 
insects hiding and prevent contamination of bolls. 

(vi) ornamental plants: it may be preferred to 
decrease the number of hairs on hairy and glabrous 
varieties of a range of garden plants. 

2. Trichomas on roots: manipulation of the number of 
root hairs may affect water and nutrient absorption (crop 
nutrient use efficiency) by the plants. Root hairs are 
also involved in anchoring the plant in the soil, 
particularly sandy soils, and allow better root 
penetration . 

3. Seed mucilage: manipulation may lead to better seed 
germination in dry soils, due to maintenance of moisture 
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around the seed. 

4 . Seed dormancy; alteration of seed dormancy may allow 
quicker, or slower (if viviparous) , germination of seeds 
after harvest. This may lead to faster cycling of crops. 
5 5. Anthocyanin pigments: have a range of functions in 

the plant, and manipulation may alter pigmentation of 
seeds, leaves, flowers and fruit. Such manipulation may 
lead to : 

(i) UV-B protection of plants, mainly in leaves, but 
10 anthocyanins are produced in plants under a wide 

range of stresses, including water stress, light 
stress, increased sugars. These stresses lead to 
decreased photosynthesis and susceptibility to 
photoxidat ion . 

15 (ii) Altered flower and leaf colour in ornamentals 

and food crops, eg broccoli; altered fruit and seed 
colour in food, eg aubergines and grains {maize, 
rice, etc) . 

6. Condensed tannins, produced by the polymerisation of 
20 anthocyanin precursors, are found in many plants and are 

responsible in part for the taste characteristics of a 
range of fruits and vegetables, such as apple, kiwifruit, 
gooseberry, redcurrant and banana. Condensed tannins 
produce characteristic astringent properties in tea, 

25 coffee, wine, spices and fruit juices. Tannins also have 

important effects in animal feedstuffs. In monogastric 
animals, such as pigs and chickens, tannins limit the use 
of potential feedstuffs such as faba beans and sorghum. 
In ruminants, moderate levels of tannins are beneficial 

30 and may improve retention of dietary nitrogen, but higher 

levels reduce the nutritive value of foliage and 
feedstuffs. Manipulation of TTGl may alter the levels of 
condensed tannins in these plants. 

7. Stomata on hypocotyls: increases in the number 

35 stomata may result in faster seedling growth under ideal 

conditions, such as optimum water and COj availability. 
The present invention provides a nucleic acid 
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isolate encoding a polypeptide including the amino acid 
sequence shown in Figure 3 (SEQ ID No. 2) or homologues 
thereof, which may include the coding sequence shown in 
Figure 3 which is that of the TTGl gene of Arabidopsis 
thaliana, and/or other transcribed parts of the gene e.g. 
as shown in Figure 3 or Figure 5 . 

Nucleic acid according to the present invention may 
have the sequence of an TTGl gene of Arabidopsis 
thaliana, or be a mutant, variant, derivative or allele 
or a homologue of the sequence provided. Preferred 
mutants, variants, derivatives and alleles are those 
which encode a protein which retains a functional 
characteristic of the protein encoded by the wild-type 
gene, especially the ability to affect a physical 
characteristic of a plant, such as the phenotypic 
characteristics outlined above. 

A mutant, variant, derivative or allele in 
accordance with the present invention may have the 
ability to affect a physical characteristic of a plant, 
particularly a phenotypic characteristic identified 
above. Thus, a mutant, variant, derivative or allele may 
decrease the amount of anthocyanins in the epidermis of 
leaves and stems compared with wild- type on expression in 
a plant, e.g. compared with the effect obtained using a 
gene sequence expressing the polynucleotide sequence of 
Figure 3 . 

Alternatively or in addition, a mutant, variant, 
derivative or allele increases or decreases the number of 
trichomes on different organs compared with wild-type on 
expression in a plant, e.g. compared with the effect 
obtained using a gene sequence expressing the 
polynucleotide sequence of Figure 3. Down -regulation of 
TTGl activity may be achieved by mutant nucleic acids 
(e.g through co-suppression) or by mutant polypeptides, 
which may compete for receptors or other binding sites 
for TTGl, without triggering appropriate effects. 

Comparison of effect on the increase or decrease of 
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trichomas or other characteristics may be performed in 
Arabidopsis thaliana , although nucleic acid according to 
the present invention may be used in the production of a 
wide variety of plants and for influencing a phenotypic 
5 characteristic thereof. 

Changes to a sequence, to produce a mutant, variant 
or derivative, may be by one or more of addition, 
insertion, deletion or substitution of one or more 
nucleotides in the nucleic acid, leading to the addition, 

10 insertion, deletion or substitution of one or more amino 

acids in the encoded polypeptide. Further, it may lead to 
the creation of stop codons resulting truncated 
polypeptide; removal of stop codons resulting in extended 
polypeptides; or a frameshift resulting in a polypeptide 

15 lacking TTGl function. Of course, changes to the nucleic 

acid which make no difference to the encoded amino acid 
sequence are included. 

A preferred nucleic acid sequence for an TTGl gene 
is shown as the coding sequence within Figure 3/SEQ ID 

20 No. 1, alongside the predicted amino acid sequence of a 

polypeptide according to the present invention which has 
TTGl function (SEQ ID No. 2) . 

Particular mutant alleles of the nucleic acid 
according to the present invention include: 

25 a) ttgl.lO (SEQ ID No. 3) which contains a point 

mutation (G to A) in the 5' untranslated part of the TTGl 
sequence (see Fig 3); 

b) ttgl.l9 (SEQ ID No. 4) which results in the 
introduction of a stop codon at codon 183; 

30 c) ttgl.l (SEQ ID No. 5 - formerly designated 

ttgl.21) which results in the introduction of a stop 
codon at codon 317; 

d) ttgl.20 (SEQ ID No. 6) which contains a point 
mutation (S to C) at codon 30, plus introduction of a 

35 stop codon at codon 310; 

e) ttgl.9 (SEQ ID No. 7) which contains a point 
mutation (S to F) at codon 282. 
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f) Ctgl.15, ttgl.16, ttgl.l7, ttgl.18 which all 
result in a stop codon at codon 310 (via a substitution 
of 2 different bases - TCGGCT to TAGACT - this sequence 
is designated SEQ ID No . 13) . 

Mutant alleles b) to f) contain point mutations that 
result in changes to the protein product. Mutations in 
the 5 ' untranslated leader sequences may affect the 
translation of the RNA. 

Interestingly the present inventors have established 
that these mutations can lead to quite different 
phenotypes for the plants expressing them. For instance 
ttgl.9 has more anthocyanin present than ttgl . l 
{reference allele) while ttgl. 10 has more trichomes, 
different seed mucilage and less anthocyanin than ttgl.9 
(see Larkin et al, Plant Cell 6, 1065-1076 for an 
analysis) . 

Thus it is clear that, given the sequence 
information disclosed herein, it will be possible for the 
skilled person, if desried, to generate ttgl mutant 
having some (but not all) of the pleiotropic effects of 
the wild-type/ecotype TTGl gene. 

It will be appreciated by the skilled person that 
the above exemplified point mutations may be present 
individually or in combination with other point 
mutations. In other words, a mutant, allele, variant or 

nderl'vativ-e -amino ~airlii-se'C[uence --in-accDr 

present invention may include within the sequence shown 
in Figure 3, a single amino acid change with respect to 
the sequence shown in Figure 3, or 2, 3, 4, 5, 6, 7, 8, 
or 9 changes, about 10, 15, 20, 30, 40 or 50 changes, or 
greater than about 50, 60, 70, 80 or 90 changes. In 
addition to one or more changes within the amino acid 
sequence shown in Figure 3, a mutant, allele, variant or 
derivative amino acid sequence may include additional 
amino acids at the C-terminus and/or N-terminus. 

A sequence related to a sequence specifically 
disclosed herein shares homology with that sequence. 
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Homology may be at the nucleotide sequence and/or amino 
acid sequence level. Preferably, the nucleic acid and/or 
amino acid sequence shares homology with the nucleotide 
sequence of Figure 3 , or the amino acid sequence encoded 
thereby. Preferably the homology is at least about 50%, 
or 60%, or 70%, or 80% homology, most preferably at least 
about 90%, 95%, 96%, 97%, 98% or 99% homology. 

As is well -understood, homology at the amino acid 
level is generally in terms of amino acid similarity or 
identity. Similarity allows for "conservative 
variation", i.e. substitution of one hydrophobic residue 
such as isoleucine, valine, leucine or methionine for 
another, or the substitution of one polar residue for 
another, such as arginine for lysine, glutamic for 
aspartic acid, or glutamine for asparagine . Similarity 
may be as defined and determined by the TBLASTN program, 
of Altschul et al. (1990) J". Mol . Biol. 215: 403-10, 
which is in standard use in the art, or, and this may be 
preferred, the standard program BestFit, which is part of 
the Wisconsin Package, Version 8, September 19 94, 
(Genetics Computer Group, 575 Science Drive, Madison, 
Wisconsin, USA, Wisconsin 53711) . BestFit makes an 
optimal alignment of the best segment of similarity 
between two sequences. Optimal alignments are found by 
inserting gaps to maximize the number of matches using 
...the- -local- homology-a-lgoritiira— of-Smi4:h— and— Watesnaa.-- 

Homology may be over the full-length of the relevant 
sequence shown herein, or may more preferably be over a 
contiguous sequence of about or greater than about 20, 
25, 30, 33, 40, 50, 67, 133, 167, 200, 233, 267, 300, 333 
or more amino acids or codons, compared with the relevant 
amino acid sequence or nucleotide sequence as the case 
may be . 

Also provided by an aspect of the present invention 
is nucleic acid including or consisting essentially of a 
sequence of nucleotides complementary to a nucleotide 
sequence with any sequence provided herein. Further, 
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there is provided nucleic acid including or consisting 
essentially of a sequence of nucleotides complementary to 
a nucleotide sequence hybridisable with any encoding 
sequence provided herein. Another way of looking at this 
would be for nucleic acid according to this aspect to be 
hybridisable with a nucleotide sequence complementary to 
any encoding sequence provided herein. Of course, DNA is 
generally double-stranded and blotting techniques such as 
Southern hybridisation are often performed following 
separation of the strands without a distinction being 
drawn between which of the strands is hybridising. 
Preferably the hybridisable nucleic acid or its 
complement encode a product able to influence a physical 
characteristic of a plant, particularly a phenotypic 
characteristic as described above. Preferred conditions 
for hybridisation are familiar to those skilled in the 
art, but are generally stringent enough for there to be 
positive hybridisation between the sequences of interest 
to the exclusion of other sequences . 

For instance, screening may initially be carried out 
under conditions, which comprise a temperature of about 
37°C or less, a formamide concentration of less than 
about 50%, and a moderate to low salt (e.g. Standard 
Saline Citrate (^SSC) = 0.15 M sodium chloride; 0.15 M 
sodium citrate; pH 7) concentration. 

"Alt: e m-a-txvel-y 7— a— t empe r art ure— of — atrout.— 5-a^e-or-i«ss 

and a high salt (e.g. 'SSPE'= 0.180 mM sodium chloride; 
9 mM disodium hydrogen phosphate; 9 mM sodium dihydrogen 
phosphate; 1 mM sodium EDTA; pH 7.4). Preferably the 
screening is carried out at about 37°C, a formamide 
concentration of about 20%, and a salt concentration of 
about 5 X SSC, or a temperature of about 50°C and a salt 
concentration of about 2 X SSPE. These conditions will 
allow the identification of sequences which have a 
substantial degree of homology (similarity, identity) 
with the probe sequence, without requiring the perfect 
homology for the identification of a stable hybrid. 
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Suitable conditions include, e.g. for detection of 
sequences that are about 80-90% identical, hybridization 
overnight at 42 °C in 0.2 5M NasHPO, , pH 7 . 2 , 6.5% SDS, 10% 
dextran sulfate and a final wash at 55°C in 0 . IX SSC, 
5 0.1% SDS. For detection of sequences that are greater 

than about 90% identical, suitable conditions include 
hybridization overnight at 6 5°C in 0.2 5M NajHPO^, pH 7.2, 
6.5% SDS, 10% dextran sulfate and a final wash at 60°C in 
O.IX SSC, 0.1% SDS. 

10 The nucleic acid, which may contain, for example, 

DNA encoding the amino acid sequence of Figure 3, as 
genomic or cDNA, may be in the form of a recombinant and 
preferably replicable vector, for example a plasmid, 
cosmid, phage or Agrobacterium binary vector. The 

15 nucleic acid may be under the control of an appropriate 

promoter or other regulatory elements for expression in a 
host cell such as a microbial, e.g. bacterial, or plant 
cell. In the case of genomic DNA, this may contain its 
own promoter or other regulatory elements and in the case 

20 of cDNA this may be under the control of an appropriate 

promoter or other regulatory elements for expression in 
the host cell. 

A vector including nucleic acid according to the 
present invention need not include a promoter or other 

25 regulatory sequence, particularly if the vector is to be 

used to introduce the nucleic acid into cells for 
recombination into the genome . 

Those skilled in the art are well able to construct 
vectors and design protocols for recombinant gene 

30 expression. Suitable vectors can be chosen or 

constructed, containing appropriate regulatory sequences, 
including promoter sequences, terminator fragments, 
polyadenylation sequences, enhancer sequences, marker 
genes and other sequences as appropriate. For further 

35 details see, for example, Molecular Cloning: a Laboratory 

Manual: 2nd edition, Sambrook et ai , 1989, Cold Spring 
Harbor Laboratory Press. Many known techniques and 
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protocols for manipulation of nucleic acid, for example 
in preparation of nucleic acid constructs, mutagenesis, 
sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in 
5 detail in Current Protocols in Molecular Biology, Second 

Edition, Ausubel et al. eds . , John Wiley & Sons, 1992. 
The disclosures of Sambrook et al . and Ausubel et al . are 
incorporated herein by reference. Specific procedures 
and vectors previously used with wide success upon plants 

10 are described by Bevan (Nucl . Acids Res. 12, 8711-8721 

(1984)) and Guerineau and Mullineaux (1993) (Plant 
transformation and expression vectors. In: Plant 
Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS 
Scientific Publishers, pp 121-148) . 

15 Selectable genetic markers may be used consisting of 

chimeric genes that confer selectable phenotypes such as 
resistance to antibiotics such as kanamycin, hygromycin, 
phosphinotricin, chlorsulfuron, methotrexate, gentamycin, 
spectinomycin, imidazolinones and glyphosate. 

20 Nucleic acid molecules and vectors according to the 

present invention may be provided isolated and/or 
purified from their natural environment, in substantially 
pure or homogeneous form, or free or substantially free 
of nucleic acid or genes of the species of interest or 

25 origin other than the sequence encoding a polypeptide 

with the required function. Nucleic acid according to 
the present invention may include cDNA, RNA, genomic DNA 
and may be wholly or partially synthetic. The term 
■"isolate" encompasses all these possibilities. Where a 

30 DNA sequence is specified, e.g. with reference to a 

figure, unless context requires otherwise, the RNA 
equivalent, with U substituted for T where it occurs, is 
encompassed . 

When introducing a chosen gene construct into a 
35 cell, certain considerations must be taken into account, 

well known to those skilled in the art . The nucleic acid 
to be inserted should be assembled within a cons-truct 
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which contains effective regulatory elements which will 
drive transcription. There must be available a method of 
transporting the construct into the cell. Once the 
construct is within the cell membrane, integration into 
the endogenous chromosomal material either will or will 
not occur. Finally, as far as plants are concerned the 
target cell type must be such that cells can be 
regenerated into whole plants . 

Plants transformed with the DNA segment containing 
the sequence may be produced by standard techniques which 
are already known for the genetic manipulation of plants. 
Plants can be transformed with DNA using any suitable 
technology, such as a disarmed Ti-plasmid vector carried 
by Agrobacterium exploiting its natural gene transfer 
ability (EP-A-270355 , EP-A-0116718 , NAR 12(22) 8711 - 
87215 1984), particle or microprojectile bombardment {US 
5100792, EP-A-444882, EP-A-434616) microinjection (WO 
92/09696, WO 94/00583, EP 331083, BP 175966, Green et al . 
(1987) Plant Tissue and Cell Culture, Academic Press) , 
electroporation (EP 290395, WO 8706614 Gelvin Debeyser - 
see attached) other forms of direct DNA uptake (DE 
4005152, WO 9012096, US 4684611), liposome mediated DNA 
uptake (e.g. Freeman et al. Plant Cell Physiol. 29: 1353 
(1984)), or the vortexing method (e.g. Kindle, PNAS 
U.S.A. 87: 1228 {1990d) Physical methods for the 
transformation of plant cells are reviewed in Oard, 1991, 
Biotech. Adv. 9: 1-11. 

Agrobacterium transformation is widely used by those 
skilled in the art to transform dicotyledonous species. 
Recently, there has been substantial progress towards the 
routine production of stable, fertile transgenic plants 
in almost all economically relevant monocot plants 
(Toriyama, et al . (1988) Bio/Technology 6. 1072-1074; 
Zhang, et al . (1988) Plant Cell Rep. 1, 379-384; Zhang, 
et aJ . (1988) Theor Appl Genet 76, 835-840; Shimamoto, et 
al. (1989) Nature 338, 274-276; Datta, et al . (1990) 
Bio /Technology 8, 736-740; Christou, et aJ . (1991) 
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Bio/Technology 9, 957-962; Peng, ec ai . (1991) 
International Rice Research Institute, Manila, 
Philippines 563-574; Cao, et a J . (1992) Plant Cell Rep. 
11, 585-591; Li, et al . (1993) Plant Cell Rep. 12, 250- 
255; Rathore, et al . (1993) Plant Molecular Biology 21, 
871-884; Fromm, etal. (1990) B io /Techno logy 8 , 833-839; 
Gordon-Kamm, et al. (1990) Plant Cell 2, 603-618; 
D'Halluin, et al . (1992) Plant Cell 4, 1495-1505; 
Walters, et ai . (1992) Plant Molecular Biology 18, 189- 
200; Koziel, et al . (1993) Biotechnology li, 194-200; 
Vasil, I. K. (1994) Plant Molecular Biology 25, 925-937; 
Weeks, et al . (1993) Plant Physiology 102, 1077-1084; 
Somers, et al . (1992) Bio/Technology 10, 1589-1594; 
W092/14828) . . In particular, Agrrobacterimn mediated 
transformation is now emerging also as an highly 
efficient alternative transformation method in monocots 
(Hiei et ai . (1994) The Plant Journal 6, 271-282). 

The generation of fertile transgenic plants has been 
achieved in the cereals rice, maize, wheat, oat, and 
barley (reviewed in Shimamoto, K. (1994) Current Opinion 
in Biotechnology 5, 158-162.; Vasil, et ai . (1992) 
Bio/Technology 10. 667-674; Vain et al . , 1995, 
Biotechnology Advances 13 (4) : 653-671; Vasil, 1996, 
Nature Biotechnology 14 page 702) . 

Microprojectile bombardment, electroporation and 
direct DNA uptake are preferred where Agrobacterium is 
inefficient or ineffective. Alternatively, a combination 
of different techniques may be employed to enhance the 
efficiency of the transformation process, eg bombardment 
with Agrobacterium coated microparticles (EP-A-4a6234 ) or 
microprojectile bombardment to induce wounding followed 
by co-cultivation with Agrobacterium (EP-A-486233) . 

Following transformation, a plant may be 
regenerated, e.g. from single cells, callus tissue or 
leaf discs, as is standard in the art. Almost any plant 
can be entirely regenerated from cells, tissues and 
organs of the plant. Available techniques are reviewed 
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in Vasil eC al . , Cell Culture and Somatic Cell Genetics 
of Plants, Vol I, II and III, Laboratory Procedures and 
Their Applications , Academic Press, 1984, and Weissbach 
and Weissbach, Methods for Plant Molecular Biology, 
5 Academic Press, 1989. 

The particular choice of a transformation technology 
will be determined by its efficiency to transform certain 
plant species as well as the experience and preference of 
the person practising the invention with a particular 

10 methodology of choice. It will be apparent to the skilled 

person that the particular choice of a transformation 
system to introduce nucleic acid into plant cells is not 
essential to or a limitation of the invention, nor is the 
choice of technique for plant regeneration. 

15 A TTGl gene and modified , versions thereof (alleles, 

mutants, variants and derivatives thereof) , may be used 
to affect a physical characteristic, such as hairs on 
roots and aerial parts of plants and anthocyanin pigments 
characteristics, in plants. For this purpose nucleic 

20 acid such as a vector as described herein may be used for 

the production of a transgenic plant. Such a plant may 
possess an altered phenotype as described above compared 
with wild-type (that is to say a plant that is wild-type 
for TTGl or the relevant homologue thereof) . 

25 The invention further encompasses a host cell 

transformed with nucleic acid or a vector according to 
the present invention, especially a plant or a microbial 
cell. Thus, a host cell, such as a plant cell, including 
heterologous nucleic acid according to the present 

30 invention is provided. Within the cell, the nucleic acid 

may be incorporated within the chromosome. There may be 
more than one heterologous nucleotide sequence per 
haploid genome . 

Also according to the invention there is provided a 

35 plant cell having incorporated into its genome nucleic 

acid, particularly heterologous nucleic acid, as provided 
by the present invention, under operative control of a 
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regulatory sequence for control of expression. The 
coding sequence may be operably linked to one or more 
regulatory sequences which may be heterologous or foreign 
to the gene, such as not naturally associated with the 
5 gene for its expression. The nucleic acid according to 

the invention may be placed under the control of an 
externally inducible gene promoter to place expression 
under the control of the user. 

A suitable inducible promoter is the GST- 11-27 gene 

10 promoter which has been shown to be induced by certain 

chemical compounds which can be applied to growing 
plants. The promoter is functional in both 
monocotyledons and dicotyledons. It can therefore be 
used to control gene expression in a variety of 

15 genetically modified plants, including field crops such 

as canola, sunflower, tobacco, sugarbeet, cotton; cereals 
such as wheat, barley, rice, maize, sorghum; fruit such 
as tomatoes, mangoes, peaches, apples, pears, 
strawberries, bananas, and melons; and vegetables such as 

20 carrot, lettuce, cabbage and onion. The GST- II -27 

promoter is also suitable for use in a variety of 
tissues, including roots, leaves, stems and reproductive 
tissues. 

A further aspect of the present invention provides a 
25 method of making such a plant cell involving introduction 

of nucleic acid or a suitable vector including the 
sequence of nucleotides into a plant cell and causing or 
allowing recombination between the vector and the plant 
cell genome to introduce the sequence of nucleotides into 
30 the genome. The invention extends to plant cells 

containing nucleic acid according to the invention as a 
result of introduction of the nucleic acid into an 
ancestor cell. 

The term "heterologous" may be used to indicate that 
35 the gene/sequence of nucleotides in question have been 

introduced into said cells of the plant or an ancestor 
thereof, using genetic engineering, i.e. by human 
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intervention. A transgenic plant cell, i.e. transgenic 
for the nucleic acid in question, may be provided. The 
transgene may be on an extra-genomic vector or 
incorporated, preferably stably, into the genome. A 
5 heterologous gene may replace an endogenous equivalent 

gene, i.e. one which normally performs the same or a 
similar function, or the inserted sequence may be 
additional to the endogenous gene or other sequence. An 
advantage of introduction of a heterologous gene is the 

10 ability to place expression of a sequence under the 

control of a promoter of choice, in order to be able to 
influence expression according to preference. 
Furthermore, mutants, variants and derivatives of the 
wild-type gene, e.g. with higher or lower activity than 

15 wild-type, may be used in place of the endogenous gene. 

Nucleic acid heterologous, or exogenous or foreign, to a 
plant cell may be non-naturally occurring in cells of 
that type, variety or species. Thus, nucleic acid may 
include a coding sequence of or derived from a particular 

20 type of plant cell or species or variety of plant, placed 

within the context of a plant cell of a different type or 
species or variety of plant. A further possibility is 
for a nucleic acid sequence to be placed within a cell in 
which it or a homologue is found naturally, but wherein 

25 the nucleic acid sequence is linked and/or adjacent to 

nucleic acid which does not occur naturally within the 
cell, or cells of that type or species or variety of 
plant, such as operably linked to one or more regulatory 
sequences, such as a promoter sequence, for control of 

30 expression. A sequence within a plant or other host cell 

may be identifiably heterologous, exogenous or foreign. 

Plants which include a plant cell according to the 
invention are also provided, along with any part or 
propagule thereof, seed, selfed or hybrid progeny and 

35 descendants. A plant according to the present invention 

may be one which does not breed true in one or more 
properties. Plant varieties may be excluded, 
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particularly registrable plant varieties according to 
Plant Breeders' Rights. It is noted that a plant need 
not be considered a "plant variety" simply because it 
contains stably within its genome a transgene, introduced 
5 into a cell of the plant or an ancestor thereof. 

In addition to a plant, the present invention 
provides any clone of such a plant, seed, selfed or 
hybrid progeny and descendants, and any part of any of 
these, such as cuttings, seed. The invention provides 

10 any plant propagule, that is any part which may be used 

in reproduction or propagation, sexual or asexual, 
including cuttings, seed and so on. Also encompassed by 
the invention is a plant which is a sexually or asexually 
propagated off-spring, clone or descendant of such a 

15 plant, or any part or propagule of said plant, off- 

spring, clone or descendant. 

The invention further provides a method of 
influencing or affecting a physical characteristic e.g. 
hairs on roots and aerial parts of plants and/or the 

20 presence or absence of anthocyanin pigment, including 

causing or allowing expression of a heterologous nucleic 
acid sequence as discussed within cells of the plant. 

The invention further provides a method including 
expression from nucleic acid encoding the amino acid 

25 sequence of Figure 3, or a mutant, variant, allele or 

derivative of the sequence, within cells of a plant 
(thereby producing an encoded polypeptide) , following an 
earlier step of introduction of the nucleic acid into a 
cell of the plant or an ancestor thereof . Such a method 

30 may influence or affect a phenotypic characteristic of 

the plant, such as those mentioned above. This may be 
used in combination with any other gene, such as 
transgenes involved in any other phenotypic trait or 
desirable property. 

35 The present invention also encompasses the 

expression product of the nucleic acid sequence disclosed 
and methods of making the expression product by 
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expression from nucleic acid encoding therefor under 
appropriate conditions, which may be in appropriate host 
cells. Following expression, the product may be isolated 
from the expression system and may be used as desired, 
5 for instance in formulation of a composition including at 

least one additional component. 

Purified TTGl protein, or a variant thereof, e.g. 
produced recombinantly by expression from encoding 
nucleic acid therefor, may be used to raise antibodies 

10 employing techniques which are standard in the art . 

Antibodies and polypeptides comprising antigen-binding 
fragments of antibodies may be used in identifying 
homologues from other plant species. 

Methods of producing antibodies include immunising a 

15 mammal with the protein or a fragment thereof. 

Antibodies may be obtained from immunised animals using 
any of a variety of techniques known in the art, and 
might be screened, preferably using binding of antibody 
to antigen of interest. As an alternative or supplement 

20 to immunising a mammal, antibodies with appropriate 

binding specificity may be obtained from a recombinantly 
produced library of expressed immunoglobulin variable 
domains, e.g. using lambda bacteriophage or filamentous 
bacteriophage which display functional immunoglobulin 

25 binding domains on their surfaces; for instance see 

WO92/01047 . 

A further aspect of the present invention provides a 
method of identifying and cloning TTGl homologues from 
plant species other than Arabidopsis thaliana which 

30 method employs a nucleotide sequence obtainable from that 

shown in Figure 3 . Such a method may include the steps 
of preparing nucleic acid from plant cells under test, 
providing a nucleic acid molecule having a nucleotide 
sequence shown in Figure 3 or complementary to a nucleic 

35 acid sequence shown in Figure 3, contacting nucleic acid 

in said preparation with said nucleic acid molecule under 
conditions for hybridization of said nucleic acid 
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molecule to any said gene or homologue in said 
preparation, and identifying said gene or homologue if 
present by its hybridization with said nucleic acid 
molecule . 

5 Sequences derived from these may themselves be used 

in identifying and in cloning other sequences. The 
nucleotide sequence information provided herein, or any 
part thereof, may be used in a data-base search to find 
homologous sequences, expression products of which can be 

10 tested for ability to influence characteristics described 

above. These may have TTGl function or the ability to 
modify characteristics including hairs on roots and 
aerial parts of plants and anthocyanin pigments. 
Alternatively, nucleic acid libraries may be screened 

15 using techniques well known to those skilled in the art 

and homologous sequences thereby identified then tested 
for requisite functionality. 

Further, nucleotide sequences obtained from that 
shown in figure 3 may be used to isolate .TTGl homologous 

20 from other species of plants by techniques such as 

hybridization and polymerase chain reaction (PGR) . 

PGR techniques for the amplification/ identification 
of nucleic acid are described in US Patent No. 4,683,195. 
The nucleic acid sequence provided herein readily 

25 allows the skilled person to design PGR primers. 

Such oligonucleotide probes or primers, as well as 
the full-length sequence (and mutants, alleles, variants 
and derivatives) are also useful in identifying 
homologous sequences. Further, the present invention also 

30 extends to oligonucleotide probes or primers for 

amplification and/or identification which are obtainable 
by use of the sequence shown in Figure 3, optionally by 
selecting regions which are conserved with other 
sequences e.g. from the prior art. Alternatively it may 

35 be desirable to generate more specific primers by 

selecting regions of the TTGl which are not homologous to 
other proteins such as anil. 
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In some preferred embodiments, oligonucleotides 
according to the present invention that are fragments of 
the sequence shown in Figure 3, or any mutant, allele, 
variant, or derivatives thereof, are at least 10 
5 nucleotides in length, more preferably at least 15 

nucleotides in length, more preferably at least 20 
nucleotides in length. 

Such fragments themselves individually represent 
aspects of the present invention. 

10 Techniques corresponding to those above may also be 

used for ascertaining the genotype of mutant plants 
having altered phenotypes corresponding to TTGl 
activities (e.g which lack trichomes or anthocyanin) i.e. 
the probes and primers of the present invention can be 

15 used for diagnosing mutations in such plants, or as 

markers for these traits. 

As described above, the present invention also 
extends to nucleic acid encoding an TTGl homologue 
obtained using a nucleotide sequence derived from that 

20 shown in Figure 3. 

In certain embodiments, nucleic acid according to 
the present invention encodes a polypeptide which has 
homology with all or part of the amino acid sequence 
shown in Figure 3 , in the terms discussed already above 

25 (e.g. for length), which homology is greater over the 

length of the relevant part (i.e. fragment) (the relevant 
part being greater than 110 amino acids in length, 
preferably greater than 200 amino acids and even more 
preferably greater than 300 amino acids in length) than 

30 the homology shared between a respective part of the 

amino acid sequence of Figure 3 and the EST sequence, and 
may be greater than about 5% greater, more preferably 
greater than about 10% greater, more preferably greater 
than about 20% greater, and more preferably greater than 

35 about 3 0% greater. 

Similarly, nucleic acid according to certain 
embodiments of the present invention may have homology 
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with all or part of the nucleotide sequence shown in 
Figure 3, in the terms discussed already above (e.g. for 
length) , which homology is greater over the length of the 
relevant part (i.e. fragment) (the relevant part being 
greater than 350 nucleotide in length, preferably greater 
than 400 and even more preferably greater than 500 
nucleotide in length) than the homology shared between a 
respective part of the nucleotide sequence of Figure 3 
and may be greater than about 5% greater, more preferably 
greater than about 10% greater, more preferably greater 
than about 20% greater, and more preferably greater than 
about 30% greater. Thus, to exemplify with reference to 
one embodiment, nucleic acid may be provided in 
accordance with the present invention wherein the 
nucleotide sequence includes a contiguous sequence of 
about 3 50 nucleotides which has greater homology with a 
contiguous sequence of 350 nucleotides within the 
nucleotide sequence of Figure 3 than any contiguous 
sequence of 331 nucleotides of an EST sequence, 
preferably greater than about 5% greater homology, and so 
on . 

The provision of sequence information for the TTGl 
gene of Arabidopsis thaliana enables the obtention of 
homologous sequences from other plant species . In 
particular, it should be possible to easily isolate TTGl 
homologues from related, commercially important Brassica 
species (e.g. Brassica nigra, Brassica napus , Brassica 
campestris and Brassica oleracea) . Examples of 
homologues from Matthiola incana (ten week stock) , 
Nicotiana tobaccuin var Samsum (tobacco) and Gossypium 
hirsutum cv. Siokva 1-4 (cotton) are disclosed in the 
Examples below. 

Thus, included within the scope of the present 
invention are nucleic acid molecules which encode amino 
acid sequences which are homologues of TTGl of 
Arabidopsis thaliana. Homology may be at the nucleotide 
sequence and/or amino acid sequence level, as has already 
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been discussed above. A homologue from a species other 
than Arabidopsis thaliana encodes a product which causes 
a phenotype similar to that caused by the Arabidopsis 
thaliana TTGl gene, generally including the ability to 
5 influence a phenotypic characteristic, particularly a 

phenotypic characteristic as described above. In 
addition, mutants, derivatives or alleles of these genes 
may alter such characteristics compared with wild-type. 
TTGl gene homologues may also be identified from 

10 economically important monocotyledonous crop plants such 

as rice and maize. Although genes encoding the same 
protein in monocotyledonous and dicotyledonous plants 
show relatively little homology at the nucleotide level, 
amino acid sequences are conserved. Therefore it is 

15 possible to use public sequence databases to identify 

Arabidopsis , rice or maize cDNA clone sequences that were 
obtained in random sequencing programmes and share 
homology to the gene of interest, as has been done for 
flowering time genes isolated from Arabidopsis (e.g. CO; 

20 WO 96/14414) . 

Nucleic acid according to the invention may be used 
to modify the characteristics of a plant. This may be 
achieved by modification of expression of the nucleic 
acid according to the present invention or by interfering 

25 with the normal function of the protein encoded by the 

nucleic acid according to the present invention. For 
example, nucleic acid according to the present invention 
may be used to increase or decrease the number of 
trichomes on different organs to enhance their 

30 effectiveness. Further, it may be used to alter the 

pigmentation of seeds, leaves, flowers and fruit for UV 
protection and/or colour for presentation reasons or for 
ornamental plants. This may involve use of anti-sense or 
sense regulation, discussed further below. 

35 As noted above, other physical characteristics of 

plants may be affected by means of expression from 
nucleic acid according to the present invention. 
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Nucleic acid according to the invention, such as an 
TTGl gene or homologue, may be placed under the control 
of an externally inducible gene promoter to place the 
timing of altering the characteristics of the plant under 
the control of the user. An advantage of introduction of 
a heterologous gene into a plant cell, particularly when 
the cell is comprised in a plant, is the ability to place 
expression of the gene under the control of a promoter of 
choice, in order to be able to influence gene expression, 
and therefore characteristic modification, according to 
preference. Furthermore, mutants and derivatives of the 
wild- type gene, eg with higher or lower activity than 
wild- type, may be used in place of the endogenous gene. 

In the present invention, over -express ion may be 
achieved by introduction of the nucleotide sequence in a 
sense orientation. Thus, the present invention provides 
a method of influencing a physical e.g. a phenotypic 
characteristic described above such as, an increase or 
decrease in trichomes, characteristic of a plant, the 
method including causing or allowing expression of the 
product (polypeptide or nucleic acid transcript) encoded 
by heterologous nucleic acid according to the invention 
from that nucleic acid within cells of the plant. 

Down- regulation of expression of a target gene may 
be achieved using anti-sense technology or "sense 
regulation" ( "co- suppress ion" ) . 

In using anti -sense genes or partial gene sequences 
to down-regulate gene expression, a nucleotide sequence 
is placed under the control of a promoter in . a "reverse 
orientation" such that transcription yields RNA which is 
complementary to normal mRNA transcribed from the "sense" 
strand of the target gene. See, for example, Smith et 
al. (1988) Nature 334, 724-726; Zhang et al, (1992) The 
Plant Cell 4, 1575-1588, English et al . , (1996) The Plant 
Cell 8, 179-188- Antisense technology is also reviewed 
in Bourque, (1995), Plant Science 105, 125-149, and 
Flavell, (1994) PNAS USA 91, 3490-3496. 
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An alternative is to use a copy of all or part of 
the target gene inserted in sense, that is the same, 
orientation as the target gene, to achieve reduction in 
expression of the target gene by co-suppression. See, 
5 for example, van der Krol eC al . , (1990) The Plant Cell 

2, 291-299; Napoli at al . , (1990) The Plant Cell 2, 279- 
289; Zhang et al . , (1992) The Plant Cell 4, 1575-1588, 
and US-A-5, 231, 020 . 

The complete sequence corresponding to the coding 

10 sequence (in .reverse orientation for anti-sense) need not 

be used. For example fragments of sufficient length may 
be used. It is a routine matter for the person skilled 
in the art to screen fragments of various sizes and from 
various parts of the coding sequence to optimise the 

15 level of anti-sense inhibition. It may be advantageous 

to include the initiating methionine ATG codon, and 
perhaps one or more nucleotides upstream of the 
initiating codon. A further possibility is to target a 
conserved sequence of a gene, e.g. a sequence that is 

20 characteristic of one or more genes, such as a regulatory 

sequence . 

The sequence employed may be about 500 nucleotides 
or less, possibly about 400 nucleotides, about 300 
nucleotides, about 200 nucleotides, or about 100 

25 nucleotides. It may be possible to use oligonucleotides 

of much shorter lengths, 14-23 nucleotides, although 
longer fragments, and generally even longer than about 
500 nucleotides are preferable where possible, such as 
longer than about 600 nucleotides, than about 700 

30 nucleotides, than about 800 nucleotides, than about 1000 

nucleotides or more. 

It may be preferable that there is complete sequence 
identity in the sequence used for down- regulation of 
expression of a target sequence, and the target sequence, 

35 though total complementarity or similarity of sequence is 

not essential. One or more nucleotides may differ in the 
sequence used from the target gene. Thus, a sequence 
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employed in a down- regulation of gene expression in 
accordance with the present invention may be a wild-type 
sequence (e.g. gene) selected from those available, or a 
mutant, derivative, variant or allele, by way of 
insertion, addition, deletion or substitution of one or 
more nucleotides, of such a sequence. The sequence need 
not include an open reading frame or specify an RNA that 
would be translatable. It may be preferred for there to 
be sufficient homology for the respective anti-sense and 
sense RNA molecules to hybridise. There may be down 
regulation of gene expression even where there is about 
5%, 10%, 15% or 20% or more mismatch between the sequence 
used and the target gene . 

Generally, the transcribed nucleic acid may 
represent a fragment of an TTGl gene, such as including a 
nucleotide sequence shown in Figure 3, or the complement 
thereof, or may be a mutant, derivative, variant or 
allele thereof, in similar terms as discussed above in 
relation to alterations being made to an TTGl coding 
sequence and the homology of the altered sequence. The 
homology may be sufficient for the transcribed anti-sense 
RNA to hybridise with nucleic acid within cells of the 
plant, though irrespective of whether hybridisation takes 
place the desired effect is down-regulation of gene 
expression . 

Thus, the present invention also provides a method 
of influencing a characteristic of a plant such as any 
one of those described above, the method including 
causing or allowing anti-sense transcription from 
heterologous nucleic acid according to the invention 
within cells of the plant. 

The present invention further provides the use of 
the nucleotide sequence of Figure 3 or a fragment, 
mutant, derivative, allele, variant or homologue thereof 
for down-regulation of gene expression, particularly 
down -regulation of expression of an TTGl gene or 
homologue thereof, preferably in order to influence a 
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physical characteristic of a plant, especially a 
phenotypic characteristic such as an increase or decrease 
of trichomes on different organs and/or and increase or 
decrease in anthocyanin pigments . 
5 When additional copies of the target gene are 

inserted in sense, that is the same, orientation as the 
target gene, a range of phenotypes is produced which 
includes individuals where over-expression occurs and 
some where under-expression of protein from the target 

10 gene occurs. When the inserted gene is only part of the 

endogenous gene the number of under-expressing 
individuals in the transgenic population increases. The 
mechanism by which sense regulation occurs, particularly 
down- regulation, is not well -understood . However, this 

15 technique is also well -reported in scientific and patent 

literature and is used routinely for gene control. See, 
for example, van der Krol et al., (1990) The Plant Cell 
2, 291-229; Napoli et al . , (1990) The Plant Cell 2, 
279-289; Zhang et al , 1992 The Plant Cell 4, 1575-1588. 

20 Again, fragments, mutants and so on may be used in 

similar terms as described above for use in anti-sense 
regulation . 

Further options for down regulation of gene 
expression include the use of ribozymes, e.g. hammerhead 

25 ribozymes, which can catalyse the site-specific cleavage 

of KNA, such as mRNA (see e.g. Jaeger (1997) "The new 
world of ribozymes" Curr Opin Struct Biol 7:324-335, or 
Gibson & Shillitoe ( 1997) "Ribozymes : their functions and 
strategies form their use" Mol Biotechnol 7: 242-251.). 

30 Another option is the use of nucleic acids enconding 

non-functional or partially functional mutant proteins 
(e.g. encoded by the mutant alleles of the present 
invention, or as produced by mutagenesis) which, when 
expressed in a plant, may compete with functional TTGl 

35 proteins for e.g. receptors or other binding partners 

thereby reducing the effectiveness of those proteins. 

Thus, the present invention also provides a method 
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of influencing a phenotypic characteristic of a plant, 
the method including causing or allowing expression from 
nucleic acid according to the invention within cells of 
the plant. This may be used to suppress activity of a 
5 product with ability to influence a phenotypic 

characteristic as described above. Here the activity of 
the product is preferably suppressed as a result of 
under-expression within the plant cells. 

Also embraced within the present invention are 

10 untranscribed parts of the TTGl gene. Thus in a further 

aspect of the present invention there is disclosed a 
nucleic acid molecule encoding the promoter of the TTGl 
gene . Owing to the widespread presence of the TTGl 
transcript in the plant it is believed that this promoter 

15 is constitutive or essentially constitutive, and thus may 

have utility in producing constructs for the expression 
of genes in plants. Variant promoters having promoter 
activity are also embraced by the present invention. 
To find homologous promoters, or the minimal 

20 elements or motifs responsible for promoter activity, 

restriction enzyme or nucleases may be used to digest a 
nucleic acid molecule comprising the 5' region of Seq ID 
No 1, or mutagenesis may be employed, followed by an 
appropriate assay (for example using a reporter gene such 

25 as luciferase operably linked to the restricted 

sequence) . Methods for promoter identification may 
employed without burden by those skilled in the art in 
the light of the sequence data disclosed herein. Once 
characterised the promoters of the present invention may 

3 0 incorporated into vectors. 

Aspects and embodiments of the present invention 
will now be illustrated, by way of example, with 
reference to the accompanying figures. Further aspects 
and embodiments will be apparent to those skilled in the 

35 art. All documents mentioned in this text are 

incorporated herein by reference . 
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Brief description of the drawings 
Figure 1 

This shows A) a large scaled map (not to scale) of 
5 the TTGl region showing the relationship between probes 

used in the region. YAC end probes are indicated by 
boxes; and B) a fine scale map of the TTGl region 
determined by RFLP mapping. End probes generated from 
YACs are indicated as boxes.. The position of the cosmid 
10 g4556 is shown in relation to the genomic lambda isolated 

from this region. The recombination points are marked 
with a cross. 

Figure 2 

15 This shows a map of genomic clones used to 

complement ttgl mutants . It shows the genomic fragments 
that have been used to complement the ttgl mutant 
phenotype. Fragment (a) represents the genomic 13.8kb 
insert in pB8 with the EcoRI (E) and Xbal (X) restriction 

20 sites some of which are used to create the deletion. 

Fragment (b) represents pBSDXl, (c) represents pB8DE2 and 
(d) represents pB8DE3 . Deletions are indicated by the 
dotted lines. Fragments (a) and (d) both gave transformed 
plants with trichomes and anthocyanin. Trans formants of 

25 (b) and (c) lacked. both trichomes and anthocyanin. 

Figure 3A and 3B 

This shows the sequence of TTGl locus (SEQ ID No. 
1) . The intron (coding sequence) is in italics and the 
30 predicted amino acid sequence (SEQ ID No. 2) is shown by 

the single letter code under the nucleotide sequence. 
Five identified mutations are shown (ttgl. 10, ttgl. 19, 
ttgl. 20, ttgl.l, ttgl. 9 - SEQ ID No's. 3 to 7 
respectively). Sixty bases are shown per line. 

35 

Figure 4 

This shows an alignment of TTGl, ANll from petunia 
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and the partial sequences from Matthiola (Seq ID No 8) 
and tobacco (Seq ID Nos 9 and 10) . 



Figure 5 

5 Shows the predicted cDNA sequence of TTGl. This 

corresponds to the region shown in capitals in Fig 3, 
plus a further 10 nucleotides- which were subsequently- 
mapped to the end of the transcript by primer extension 
studies . 

10 

SumTnary of sequence ID Nos 

1: Full length TTGl DNA sequence shown in Fig 3, 

including promoter region, full cDNA sequence, and coding 

sequence (which is aligned with the amino acid sequence) . 
15 2 : TTGl amino acid sequence in Fig 3 and 4 . 

3: DNA sequence ttgl.lO 

4: DNA sequence ttgl.l9 

5: DNA sequence ttgl.l 

6: DNA sequence ttgl.20 
20 7: DNA sequence ttgl.9 

8 : Partial amino acid sequence of Matthiola TTGl 

homologue 

9: Partial amino acid sequence of first tobacco TTGl 
homologue . 

25 10: Partial amino acid sequence of second tobacco TTGl 

homologue . 

11 and 12: Degenerate primers for cloning TTGl homologues 
(see Examples below) . 

13: DNA sequence of ttgl.lS, ttgl.l6, ttgl.17 and 
30 ttgl.lB. 



Detailed description. 

Molecular Mapping of ttcrl . 1 

Recombinants between the ttgl.l and MSI loci 
35 generated in a cross between Landsberg erecta carrying 

ttgi and msl and Ws ecotypes were analysed using RFLPs 
(restriction fragment length polymorphism) between these 
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parents with probes already mapped to this region by Nam 
et al (Nam et al (1989) Plant Cell 1 699-705.). 
■Recombinants on the distal side of ttgl.I were selected 
from a cross of Landsberg erecta carrying the ttgl . 1 
5 mutation and ga3 and ch5 and the RLDl ecotype which was 

wildtype for these loci. End probes from YACs that had 
been mapped to the region between msl and ga3 (Schmidt et 
al (Schmidt et al (1997) Plant J 11 563-572)) were also 
utilized to map the location of ttgl. RFLPs generated by 

10 the cosmid g4556 could not be separated from ttgl with 

the exception of one recombinant called Dennis 19 (on the 
msl side of ttgl) suggesting that g4556 was very close to 
the mutation in ttgl.l. A YAC EG20H2 that hybridised to 
g4556 and the cosmid were used to isolate overlapping 

15 genomic lambda clones. The lambda clones were ordered 

using restriction mapping and hybridization techniques 
and then used as probes for RFLPs amongst the 
recombinants . Figure 1 shows the large scale and fine 
scale maps from the TTGl region that were derived from 

20 the analysis of the recombinants on both sides of the 

ttgl.l mutation. In Figure IB the position of several 
recombination events between msl and ttgl have been 
indicated. On the distal side of ttgl no nearby 
recombination events could be mapped due to lack of RFLPs 

25 between the ecotypes used and lack of success in 

isolating clones from this region from three different 
libraries in lambda or cosmid vectors using a variety of 
probes . 

30 Complementation of the ttgl mutation 

The complete genomic inserts from overlapping lambda 
clones I.IA, 3.1A, 8 and X5 marked in Figure IB were 
subcloned into pBinNOT using the flanking NotI 
restriction sites in the lambda vector, giving a pB 

35 series of binary vectors. These were transferred into 

Agrobac Cerium tumefaciens strain AGLl (Lazo et al (1991) 
Biotechnology 9 963-967) and then used to transform 



wo 99/00501 



PCT/GB98/01861 



34 

Arabidopsis ecotype Landsberg erecta carrying the ttgrl.l 
mutation by co-cultivation with root explants (Valvekens 
et al (1988) Proc Natl Acad Sci USA 85 5536-5540) . Only 
a small number of kanamycin-resistant transf ormants were 
5 obtained, but one plant from pB8 , derived from lambda 8, 

had trichomes but failed to set seed. Transformants from 
pBl . lA did not have trichomes. Other kanamycin-resistant 
shootlets appeared to be escapes due to prolonged 
exposure to kanamycin in the callus stage. 

10 Several deletions were made of pB8 utilizing 

restriction sites within the genomic sequence and the 
polylinker of the vector. These deletion constructs 
(shown in Figure 2) were used to transform Arabidopsis 
ecotype Columbia carrying the ttgl.9 mutation via vacuum 

15 infiltration (Bechtold et al (1993) Compt Rend Acad Sci 

Ill-Life Sci 316 1194-1199) . Sixty transformants from 
pB8DE3 (indicated as construct d) produced trichomes, 
although one transformant from pB8DE3 showed kanamycin 
resistance but had no trichomes . The transformants 

20 bearing trichomes exhibited other wild-type 

characteristics of brown seed, seed mucilage, purple 
colouring of the plant and normal root hair numbers 
indicating that the other ttgl mutant phenotypes had also 
been complemented. Thirty-six kanamycin-resistant 

25 transformants from pB8DE2 (construct c) and 19 

transformants from pB8DXl (construct b) failed not 
produce trichomes, suggesting that TTGl was located in 
the regions deleted in these constructs . 

The positional cloning of the TTGI locus has 

30 provided information about the order of, and distances 

between, a number of RFLP markers which may be used to 
isolate nearby genes. This information is complementary 
to the data given in the physical maps of the region 
(Schmidt et al . , 1997 Plant J 11: 563-572; Thorlby et 

35 al., 1997 Plant J 12, 471-479). Although the present 

inventors analysed about 400 recombinants within a 14 map 
unit region, they were still unable to find breakpoints 
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very close to the TTGl locus. This suggests that 
recombination rates are reduced close to this gene. 
Recombination frequencies are known to vary along 
chromosomes in many species (Lichten and Goldman, 1995 
5 Annu Rev Genet 29: 445-476 for a review). 

Subclonincr of pB8DE3 and secaiencing 

Restriction fragments of the insert in pB8DE3 were 
ligated into pBluescript, sequenced using fluorescent 

10 dideoxynucleotides and the sequences compiled and 

analysed using the GCG package. Sequence analysis with 
Genmark and Netgene revealed two possible genes in the 
5777 bp insert in pB8DE3 . 

One of these genes revealed no consistent homology 

15 to any known protein, was not similar to any EST clone in 

data-bases, and did not appear to code a long ORF. 

The other predicted gene corresponded to an 
Arabidopsis EST (F20055, F20056) , indicating that the 
gene was functional and expressed. The predicted protein 

20 sequence of 341 amino acids shows sequence similarity 

(about 45 %) to a large and diverse group of proteins 
with WD40 repeat motifs. There are seven WD40 repeats 
with a short N-terminal region. The first two repeats 
contain a proline-rich region - the second repeat having 

25 8/23 amino acids which are proline. Three possible TATA 

boxes have been identified 133, 189 and 216 bases 
upstream of the predicted start of translation. 
Comparison of the genomic and EST sequences indicated the 
presence of a single intron 3' of the termination codon. 

3 0 The sequence including the promoter region is shown in 

Figure 3 . Primer extension experiments indicated that 
the start of transcription is 109 bases 5' of the start 
of translation (i.e. 23 bases from a TATA box) . 



35 



Sequence analysis of ttcrl mutants 

The present inventors examined the nucleotide 
sequence of this region of a number of ttgl mutant 
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alleles to determine whether the gene encoding the WD40 
repeat protein was likely to be the TTGl locus. PGR 
products from the region were generated with primers 
designed to give overlapping fragments of about 70 0 bp. 
5 These PGR products were obtained from the ttgl mutants 

ttgl.9, ttgl. 10, ttgl. 19 and ttgl.l and from their 
parental wild- type alleles using genomic DNA as the 
template. The PGR products were gel -purified and 
sequenced using both oligonucleotides designed as primers 

10 for PGR. 

Four of the mutant alleles contained point mutations 
that would result in changes to the protein product 
(Figure 3) . Point mutations in ttgl. 19 and ttgl.l 
resulted in the introduction of stop codons at codons 183 

15 and 317, respectively. 

A point mutation in ttgl, 20 resulted in the change 
of serine to cysteine at codon 30. This allele, plus 
also alleles ttgl. 15-18 contained a premature stop codon 
at position 310. 

20 A mutation in ttgl.9 resulted in the change of 

serine to phenylalanine at codon 282. 

The mutant allele ttgl. 10 contained a point mutation 
(G to A) in the 5' untranslated leader sequence, which 
may affect the translation of the RNA. These changes in 

25 the gene encoding the WD40 repeat protein confirm its 

identity as TTGl. 

Several of the mutants have a lower level of 
transcript detected by northern analysis and compared 
with message levels from the wildtype parent (results not 

3 0 shown) . The reduction in message level in mutants such as 

ttgl- 9 could be due to nonsense- mediated mRNA decay which 
has been shown to occur in plants as well as in other 
organisms (Dickey et al . , 1994 Plant Cell 6, 1171-1176; 
van Hoof and Green, 1996 Plant J 10: 415-424). 

35 

In summary the phenotypes (all were ttgl-like) and 
mutations are as follows: 
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Nasc No 


Parent 


Nasc DecriDtion 


Allele 


Mutation 


N300 


An-1 


pale, branched 


ttgl . 


. J.D 


s310 -> * 


N319 


En-2 


dwarf 


ttgl . 


. 16 


s310 -> * 


N3 3 9 


En-2 


pale 


ttgl . 


. 17 


s3l0 -> * 


N3 72 


En-2 


upright rosette 


ttgl . 


. 16 


s310 -> * 


N4 06 


En-2 




ttgl. 


.19 


W183 -> * 


N420 


En-2 


early flowering 


ttgl. 


.20 


s30 -> c 












s310 -> * 


N447 


En-2 


dwarf 


ttgl. 


. 1 


(as ttgl.l) 



10 

Effect of TfGl on stomata of Arabidopsis 



TTGl effect on stomata appears to be analogous to 
control of root hairs (Berger et al, 1998, Dev Biol 194 
15 226-234) . 

The table below shows a comparison between the 
stomatal numbers on hypocotyls in ttgl mutants compared 
to wild- type in air and at elevated CO2 concentrations. 

20 

Air CO2 
Landsberg erecta: 23.2+3.2 8.5±1.9 

ttgl mutant: 22.8±1.4 16.0+1.6 

25 Analysis of the expression of the TTGl gene 



To determine the length of the TTGl transcript and 
to see if the expression of the gene was confined to some 
organs, an RNA blot was hybridized with a TTGl probe. The 

30 resulting band was measured to be 1.35 kb in length and 

present in all organs tested (roots, rosette leaves, leaf 
buds, stems, cauline leaves, siliques, flowers, floral 
buds) . A surprise result was that it was highly expressed 
in floral meristems where there are only a few trichomes 

35 on the sepals and no anthocyanin in the flower petals. 



The" high level of transcripts of the TTGl locus 
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suggests two possible points of regulation of the gene. 
The first is that the TTGl protein is present in many 
tissues where it requires a partner for activation. 
Another possibility is that regulation of this gene 
occurs a posttranscriptional stage with protein only 
being present in those cells that require functional TTGl 
protein . 

Structure and function of the TTGl locus 

The present inventors have identified the TTGl locus 
as a gene encoding a WD40 repeat protein by 
complementation of a ttgl mutant with genomic DNA and by 
sequencing the gene in several ttgl mutant alleles. Two 
of the mutant alleles contained stop codons that would 
result in the production of truncated proteins, and two 
others contained point mutations that would change serine 
residues, to a cysteine and phenylalanine residues. The 
TTGl protein bears no resemblance to the maize R gene 
product which was able to complement the ttgl mutant, 
phenotype in Arabidopsis and anthocyanin pigment in 
tobacco flowers (Lloyd et al (1992) Science 258 1773- 
1775) . This suggests that TTGl acts upstream of an 
Arabidopsis R homologue in the pathways leading to 
trichome differentiation and anthocyanin synthesis. The 
involvement of other WD40 proteins in signal transduction 
pathways (see below) suggests that TTGl is involved in a 
pathway, or pathways, regulating the expression or action 
of downstream transcription factors. 

Computer modelling of TTGl 

WD4 0 repeat proteins are involved in a number of 
different types of regulatory roles, such as signalling 
(eg. Gfl subunit of heterotrimeric G proteins) , cell cycle 
regulation (eg CDC20 and CDC4) , transcriptional 
repression (eg yeast TUPl, Drosophila extra sex combs), 
vesicular trafficlcing (eg SEC13) and RNA processing (Neer 
et al (1994) Nature 371 297-300) . The TTGl protein shows 
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the highest sequence similarity to Gg subunits, which are 
the best characterised of the WD40 repeat proteins. The G^ 
subunit contains 7 repeats of the WD4 0 motif and has a 
structure resembling a seven-bladed propeller, based on 
5 its crystal structure with its partner (Sondek et al 

(199 6) Nature 3 79 3 69-3 74) . Each blade is composed of 4 
]S-sheets, and an N-terminal amphipathic a-helix interacts 
closely with the G^ subunit which is required for correct 
folding and function of the G^ subunit . However, computer- 

10 aided modelling of TTGl (in collaboration with N. 

Srinivasan and T.L. Blundell) suggests that proline-rich 
regions of the first two WD40 repeats may disrupt the 
folding of the ;S-sheets essential for the structure of 
each blade. In addition, amino acid residues identified 

15 as interacting with G„ and G^ subunits are not well 

conserved in TTGl. This may suggest that TTGl represents 
a separate class of WD4 0 repeat protein. Genes encoding 
several WD40 proteins, including G^ subunits and the COPl 
protein, have been isolated from plants. (Ma (1994) Plant 

20 Mol Biol 26 1611-1634) , but none of these proteins 

closely resembles TTGl other than anil. 

Thus studies of other WD4 0 proteins make it probable 
that the TTGl protein does not act directly as a 
transcription factor but binds to other proteins to 

25 promote the initiation of trichomes in leaves and stems. 

The TTGl protein may act as part of a DNA binding complex 
to regulate transcription. However the amino acid 
sequence contains no recognizable nuclear localization 
signal from computer analysis, although a cryptic site 

30 might be present. Another possibility is that another 

protein is required to form a complex for nuclear import 
as is the case with AP3 and PI from Arabidopsis 
(McGonigle et al . , 1996 Genes Dev 10, 1812-1821). Another 
possibility is that the TTGl protein is only located in 

35 the cytoplasm and acts as part of a signal transduction 

pathway. GUS-TTGl and TTGl-GFP fusion proteins appear to 
be -cytoplasmically located. 
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There are several sequences of unknown function that 
show higher similarity to TTGl than the subunit from 
either plants or animals. One sequence is the result of a 
genomic sequencing project in Arabidopsis and has a 85% 
similarity to TTGl and is located on chromosome 3 . Gel 
blots hybridized and washed at low stringency show at 
least three bands in Arabidopsis and maize suggesting 
that TTGl could belong to a class of proteins. Two 
C.elegans genes arrayed in tandem have greater similarity 
to TTGl than any locus from yeast Saccaromyces cerevici. 
This is surprising as yeast is more closely related to 
Arabidopsis than C.elegans is. However if TTGl plays a 
role in defining functions in epidermal cells, this 
function may also be required in other multicellular 
organisms . 

Transcription factors like those from maize and 
Antirrhinum have been identified in Petunia where they 
regulate flower colour (reviewed in Mol et al . , 1996). 
JAF13 is similar to the J? gene and Delila and AN2 encodes 
a MYB factor like CI from maize. These are thought to act 
together in a similar way to R and CI to positively 
regulate the anthocyanin pathway. ANll from Petunia 
controls anthocyanin biosynthesis in flowers (de Vetten 
et al., 1997 Genes Dev ii, 1422-1434) possibly by 
regulating AN2 . This is contrary to the evidence that 
TTGl might regulate MYC transcription factors, determined 
by overexpression of the maize R gene in Arabidopsis 
(Lloyd et al.,.1992 Proc Nat Acad Sci USA 86: 7092-7096). 
The identification of two WD40 repeat proteins which 
regulate anthocyanins and, in the case of the TTGl 
protein, many other pathways suggests that this class of 
protein may be involved in regulating developmental 
pathways in other organisms . 

Cloning of the Matthiola incana , Nicotiana tobaccum and 
cotton homologues 

Primers for degenerate PGR were designed by 
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comparison of the TTGl sequence with the anil sequence 
(see Fig 4) . Primers were based on the sequence encoding 
amino acids 74-85,. and 296-307. 

5 Primer sequences were: 

5' (TL3) TTYGAICAYCCITAYCCICCIACIAARYTIATGTT (Seq ID 

No 11) 

3' (TL4) CATIGGRTGRATICCRTTIGGICCIGCIACIGTNGG (Seq ID 

10 No 12) 

Amplification conditions were ascertained using a 
temperature gradient in a Robocycler (Stratagene) . Ing 
of genomic DNA template was amplified with 5pmol of each 
15 primer using Taq DNA polymerase (Qiagen) with the 

addition of QX (Qiagen) for stabilising the DNA-primer 
complex . 

Once an annealing temperature of 42°C was 
established, genomic DNA from Matthiola . incana (ten week 

20 stock) . Nicotiana toJbaccuni var Samsum (tobacco) and 

Gossypium hirsutum cv. Siokva 1-4 (cotton) was used as a 
template in 50ul reactions with the temperature ramped 
between annealing and extension to 15 degrees per minute. 
The amplified bands were size- f ractioned and 

25 extracted from a gel. The Matthiola DNA was polished 

with Klenow enzyme to make a blunt end ligated into the 
EcoRV site of pBluescript. Three independent constructs 
were sequenced. The gel purified PGR product was used as 
a probe on a Southern blot to verify that the product 

3 0 originated from Matthiola. At high stringency (65°C in 

O.lXSSC+1% SDS) the PGR product cross hybridises to the 
TTGl gene of Arabidopsis. 

Tobacco sequences (tobacco 1 and 2) were obtained 
from two constructs (pTOBl and pT0B2) using T- vectors 

3 5 bases on pBluescript. Each corresponds to one of the 

genomic sequences found in tobacco which is an 
allotetraploid species . The two sequences are 95% 
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identical at both the nucleotide and the amino acid 
level. pTOBl hybridises to both tobacco genes but only- 
weakly to TTGl at high stringency. The cotton gene is 
currently being sequenced. 

Garden blots reveal the presence of several similar 
sequences in Arabidopsis, tobacco. Petunia and zea mays 
when hybridised at 50°C in 5M NaCl and washed at 50°C in 
IXSSC+1% SDS. 

Use of TTGl ant i -sense constructs 

The insert in construct pTOBl was removed using Sac I 
and EcoRV and ligated into the Sad and Smal sites in the 
pROK2 vector. This gives an antisense construct with the 
3 5S promoter driving a transcript from the complementary 
strand of the TOBl-TTGl gene. Constructs containing the 
TOBl sequence may be placed into the Agrobacterium strain 
LBA4404 for transfer into tobacco plants. 

General methods 

General methods were perfomed in accordance with 
Sambrook et al (1989) discussed above. 

Plant material 

Ecotypes Landsberg erecta, Columbia, RLDl, Ws 
(Wassilewskija) were supplied by the Nottingham 
Arabidopsis Stock Centre. The Landsberg erecta line 
carrying msl and ttgl.l was from . The line containing 
ttgl.l ga3 ch5 in Landsberg erecta background was a gift. 

Growing plants 

a) for crosses and seed in the 'Arabicon system', 3:2:1 
soil vermiculite perlite was used with 16 hours light, 

b) for material in trays, 12 hours light was used. 

c) for transformation, peat-based soil with 5-10 plants 
in 4 inch pots in a glasshouse with supplementary 
lighting was used. Pots were covered in muslin, and seed 
mixed with sand was sprinkled on top. Plants were thinned 
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to 10 per pot at 2-3 weeks. Bolts were cut back once and 
allowed to reemerge for several days before infiltration. 
All three types of plants received a weekly feed of macro 
nutrients. 

5 d) in culture 1/2 MS + 0.8% agar 16 hours light was used, 

e) for root material, plants were grown on plates 
containing 1/2MS +1.2% phytogel in a nearly vertical 
position. 

10 Searching the Kranz collection for more ttql alleles. 

Candidates described as glabrous and having yellow 
seeds were grown and crossed to ttgl.l mutants. The F2 
generation was examined for segregating phenotypes . All 
were examined for seed mucilage, anthocyanin in the plant 

15 aiid the testa and for leaf hairs. . . 

DNA extractions 

Plant material was treated as in Dellaporta et 
al. (1983) Plant Mol Biol Rep 1,4, 19-21, followed by CsCl 
20 banding to remove RNA and polysaccharides (Walker et al, 

1997 Photosyn Res 54, 155-163) so as to be able to detect 
small band shifts on DNA gel blots. 

Library screens 

25 Genomic library in lambda Dashll (Stratagene) from 

Landsberg erecta (Boyce et al, 1994 Plant Physiol 106, 
1691) distributed by EEC-BRIDGE Arabidopsis DNA Stock 
Centre. 

Construction of the pBINNOT vector 

The pBIN19 vector was modified to contain a NotI 
site in the polylinker. To remove the original NotI site, 
pBIN19 was restricted with NotI, treated with Klenow and 
dNTPs to fill in the site, ligated in a large volume, 
restricted again with NotI and transfected into E.coli 
strain TGI. Plasmid DNA was isolated from resulting 
colonies to check that the original NotI site no longer 



30 



35 
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existed. The vector was restricted with Xbal and Asp718 . 
Two annealed oligonucleotides (NX: GTACCGCGGCCGCAT AND 
N2: CTAGATGCGGCCGCG) containing a NotI site were ligated 
into the vector to reconstitute the Xbal and Asp718 
sites. The ligated DNA was restricted with BamHI to 
remove parental molecules. In effect, the BamHI site in 
the polylinker of pBIN19 has been replaced with a unique 
NotI site and the altered vector called pBINNOT. 

Plant transformation 

Agrobacterium strain Agll (Lazo et al . , 1991 
Biotechnology 9, 963-967) was transfoirmed with constructs 
based on pBINNOT vector by electroporation . Using vacuum 
infiltration (Bechtold et al , 1993 Compt Rend Acad Sci 
III Life Sci 316, 1194-1199) . DNA containing genomic 
fragments were introduced into ttgl.9 mutant plants. 



wo 99/00501 



PCT/GB98/01861 



45 

Claims 

1. An isolated nucleic acid molecule encoding a 
polypeptide with TTGl function. 

2 . A nucleic acid as claimed in claim 1 wherein the 
polypeptide includes the amino acid sequence SEQ ID No. 2 

3 . A nucleic acid as claimed in claim 2 comprising a 
nucleotide sequence having SEQ ID NO 1 or the coding 
sequence shown therein 

4 . An isolated nucleic acid comprising a nucleic acid 
sequence which shares at least 50%; 60%; 70%; 80%; 90%; 
95%; 96%; 97%; 98%; 99% sequence identity with the 
nucleic acid of claim 2 or claim 3. 

5. A nucleic acid as claimed in claim 4 which is a 
mutant , variant, derivative of any one of the nucleic 
acid sequences of claim 2 or claim 3 by way of addition, 
insertion, deletion or substitution of one or more 
nucleotides in the nucleic acid. 

6 . A nucleic acid as claimed in claim 4 which is a TTGl 
homologue from a species other than Arabidopsis thaliana 

7. A nucleic acid as claimed in claim 6 which is a 
homologue from nicotiana or matthioia. 

8. A nucleic acid as claimed in claim 7 which encodes a 
polypeptide comprising amino acid Seq ID No 8, 9 or 10 . 



9. A nucleic acid as claimed in claim 4 which is an 
allele of the nucleic acid sequences of claim 2 or claim 
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10. A nucleic acid as claimed in any one of claims 4 to 
9 which encodes a polypeptide with TTGl function. 

11. A nucleic acid as claimed in any one of claims 1 to 
3 or claim 10 wherein the TTGl function comprises the 
ability to alter two or more of the following phenotypic 
characteristics of a plant into which said polypeptide is 
introduced: number of trichomes on the aerial parts of 
the plant; number of trichomes on the roots hairs; 
mucilage of the seeds; doirmancy of the seeds; anthocyanin 
pigmentation; condensation of the tannins; number of 
stomata on hypocotyls. 

12 . A nucleic acid as claimed in claim 9 which comprises 
the nucleotide sequence of an allele selected from: 
ttgl.lO, ttgl.l9, ttgl.l, ttgl.20, ttgl.9. 

13 . A nucleic acid including or consisting essentially 
of a sequence of nucleotides complementary to the 
nucleotide sequence of a nucleic acid as claimed in any 
one of the preceding claims . 

14 . A method of identifying a homologue or allele as 
claimed in any one of claims 6 to 9 which method employs 
a nucleotide sequence obtainable from that shown in Seq 
ID No 1, said method including the steps of: 

(i) preparing nucleic acid from plant cells under test, 

(ii) - providing a nucleic acid molecule which is a probe 
or primer having a nucleotide comprising all or part of a 
nucleotide sequence as claimed in claim 2 or claim 3, or 
complementary to that sequence 

(iii) contacting nucleic acid in said preparation with 
said probe or primer under conditions for hybridization, 
and 

(iv) identifying said gene or homologue if present by its 
hybridization with said nucleic acid molecule. 
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15 . A method as claimed in claim 14. further comprising 
the step of testing the homologue or allele for TTGl 
function. 



5 16 . A method as claimed in claim 14 or claim 15 wherein 

the plant cell nucleic acid is obtained from a plant 
species other than Arabidopsis thaliana . 

17. An oligonucleotide for use as a nucleic acid probe 
10 or primer for use in the method as claimed in any one of 

claims 14 to 16, said oligonucleotide comprising: 

(i) a nucleotide sequence encoding an amino acid sequence 
which is conserved between TTGl and the polypeptide 
encoded by the nucleic acid of any one. of claims 6 to 8, 

15 or 

(ii) a nucleotide sequence which is complementary to said 
sequence . 

.18 . An oligonucleotide as claimed in claim 17 comprising 
20 at least about 10; 15; 20; 25; 3 0 or 3 5 nucleotides in 

length . 

19 . A recombinant vector comprising the nucleic acid of 
any one of claims 1 to 13 . 

25 

20 . A vector as claimed in claim 19 wherein the nucleic 
acid is under the control of a promoter. 

21. A vector as claimed in claim 20 wherein the promoter 
30 is an inducible promoter. 

22 . A vector as claimed in any one of claims 19 to 21 
further comprising one or more of: a terminator sequence; 
a polyadenylation sequence; an enhancer sequence; a 

35 marker gene. 

23 . A host cell comprising the nucleic acid of any one 
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of claims 1 to 13 . 

24. A host cell transformed with nucleic acid of any one 
of claims 1 to 13 or a vector of any one of claims 19 to 
22- 

25. A host cell having incorporated into its genome 
heterologous nucleic acid as claimed in any one of claims 
1 to 13. 

26. A host cell as claimed in any one of claims 23 to 25 
which is a plant cell. 

27. A method of making the plant cell of claim 26, the 
method comprising the steps of: 

(i) introducing a vector as claimed in any one of claims 
19 to 22 into the plant cell, 

(ii) causing or allowing recombination between the vector 
and the plant cell genome to introduce a nucleic acid as 
claimed in any one of claims 1 to 13 into the genome . 

28. A plant which has been regenerated from the plant 
cell of claim 26. 

29. A plant as claimed in claim 28 including the plant 
cell of claim 26. 

30. A plant as claimed in claim 29 which is a clone; 
selfed or hybrid progeny, or other offspring or 
descendant of the plant of claim 28. 

31. A cutting, part, or seed or other propagule of a 
plant as claimed in any one of claims 28 to 30. 

32. A polypeptide expression product of any one of the 
nucleic acids of claims 1 to 11 which has TTGl activity. 
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33. A polypeptide as claimed in claim 32 comprising the 
amino acid sequence SEQ ID No 2 . 

34. A method of making the polypeptide of claim 32 or 

5 claim 33 by causing or allowing expression from a nucleic 

acid encoding the polypeptide, following an earlier step 
of introduction of the nucleic acid into a cell of a 
plant -or an ancestor thereof . 

10 35. Use of a polypeptide of claim 32 or claim 33 or a 

nucleic acid of any of claims 1 to 13 to regulate the 
expression or action of a transcription factor. 

36. Use of a polypeptide of claim 32 or claim 33 to 
15 raise an antibody. 

37. An antibody having specific binding affinity for the 
polypeptide claimed in claim 32 or claim 33. 

20 38. A polypeptide comprising the antigen-binding site of 

the antibody of claim 37. 

39. A method of influencing or affecting a physical 
characteristic of a plant comprising causing or allowing 

25 expression of a heterologous nucleic acid sequence as 

claimed in any one of claims 1 to 13 within the cells of 
the plant, following an earlier step of introducing the 
nucleic acid into a cell of the plant or an ancestor 
thereof . 

30 

40. A method as claimed in claim 39 wherein the nucleic 
acid is expressed under the control of an inducible 
promoter . 

35 41. A method for downwardy modulating the expression of 

a nucleic acid as claimed in any one of claims 1 to 12 in 
a plant, the method comprising any of the following: 
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(i) causing or allowing transcription from a nucleic acid 
as claimed in claim 13 in the plant; 

(ii) causing or allowing transcription from a nucleic 
acid as claimed in claims 1 to 12 or a part thereof in 
the plant such as to reduce expression by co- suppress ion; 

(iii) use of nucleic acid encoding a ribozyme specific 
for a nucleic acid as claimed in any one of claims 1 to 
12 . 

42 . A method of influencing any one or more of the 
following a phenotypic characteristics of a plant: 
insect protection; chemical production; climate 
tolerance; salt removal; fibre production; ornamental 
value; water and nutrient absorption; initiation of seed 
germination; pigmentation; taste; speed of seedling 
growth; the method comprising a method as claimed in any 
one of claims 39 to 41. 

43 . An isolated nucleic acid molecule comprising a 
sequence encoding the promoter sequence of the TGGl gene, 
or a mutant, variant, derivative or other homolog 
thereof . 

44 . A vector comprising the promoter sequence of claim 
43 . 
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Fig, 3 A 

agttatgtttatgcttttcatcatatttagtggttagtttttattatttatttattgatt 60 

catgacttatgctagattatgataagaatttatgttaccacttgataaatcctccatttg 120 

acatgtgtttaatgctagatttatattgtctccaaatttacaactttgatgtcttatgat 180 

aaatgccaacaaccaaatttcagataaagattagcagactaactaagcttattattcact 240 

tgcaaggtggagtgatgttgaaagaaccctcacagacacgtcattgggaagactaaatct 300 

ctttttagcacgttacacctttgagatcgcgtttattccatatggagagagagcaacaat 360 

acgagacatggagaggcaccattaccgccggcgcaactgcttccaaatattgacaaacaa 420 

atttgaatctggatcttctctattcgtgaacaaggagatagaagctacgatgaatgcatg 480 

gaagcttggtttgctttaatataaacactaaaggggagtagaactttcttgaaaaattgt 540 

cttcgaaccaaacgaaattatatttgtgatttcccctcatcttgaaagaactttttaaca 600 

atgcaaattatttaccgaatgttaaaagcttttttcgaataaattttacattttcttaat 660 

aataataataaaaaaggattgttgattatcttaatcacaaacaatttattttagctgaat 720 

tagacaattgttagtaaaatgattagagtgtcacatattaatgttgttagtgtttcatgt 780 

catcctagtgatccaataattaggccattctatagctcgtaacgttaaaataaaaggccc 840 

attatctgaatatacagaagcccattatcaatagatacattaaaagatactgattaatcc 900 

agagggtttatatctacgccgtctccattgattatttctccGTCTCTTGAAAAATCCGAC 960 

A in iztgl.lO 

TGACACTGACCTCAAAACTCTCCTCTCACTTTCGTCGTGAAGAAGCCAAATCTCGAATCG 1020 
AATCAGCACCACACATTTCCATGGATAATTCAGCTCCAGATTCGTTATCCAGATCGGAAA 1080 

MDNSAPDSLSRSET 

G in ttgl.20 

CCGCCGTCACATACGACTCACCATATCCACTCTACGCCATGGCTTTCTCTTCTCTCCGCT 1140 
AVTYDS PYPLYAMAFSSLRS 

CATCCTCCGGTCACAGAATCGCCGTCGGAAGCTTCCTCGAAGATTACAACAACCGCATCG 1200 
SSGHRIAVGSPLEDYNNRID 

ACATTCTCTCTTTCGATTCCGATTCAATGACCGTTAAGCCTCTCCCGAATCTCTCCTTCG 1260 
ILSFDSDSMTVKPLPNLSFE 

AGCATCCTTATCCTCCAACAAAGCTAATGTTCAGTCCTCCTTCTCTCCGTCGTCCTTCCT 1320 
HPYPPTKLMFSPPSLRRPSS 

CCGGAGATCTCCTCGCTTCCTCCGGCGATTTCCTCCGTCTTTGGGAAATTAACGAAGATT 1380 
GDLLAS SGDFLRLWEINEDS 

CATCAACCGTCGAGCCAATCTCGGTTCTCJ^CAACAGCAAAACGAGCGAGTTTTGTGCGC 1440 
STVEPI SVL NNSKTSEFCAP 

CGTTGACTTCCTTCGATTGGAAC6ATGTAGAGCC6AAACGTCTCGGAACTTGTAGTATTG 1500 
LTSFDWNDVEPKRLGTCSID 

ATACGACGTGTACGATTTGGGATATTGAGAAGTCTGTTGTTGAGACTCAGCTTATAGCTC 1560 
TTCTIWDIEKSVVETQLIAH 

A in ttgl.l9 

ATGATAAAGAGGTTCATGACATTGCTTGGGGAGAAGCTAGGGTTTTCGCATCAGTCTCTG 1620 
DKEVHD lAWGEARVPASVSA 

CTGATGGATCCGTTAGGATCTTTGATTTACGTGATAAGGAACATTCTACAATCATTTACG" 1680 
DGSVRIFDLRDKEHSTIIYE 



SUBSTITUTE SHEET (RULE 26) 



wo 99/00501 PCT/GB98/01861 



4/7 

Fig 3B 



AGAGTCCTCAGCCTGATACGCCTTTGTTAAGACTTGCTTGGAACAAACAAGATCTTAGAT 1740 
SPQPDTPLLRLAWNKQDLRY 

ATATGGCTACGATTTTGATGGATTCTAATAAGGTTGTGATTCTCGATATTCGTTCGCCGA 1800 
MATILMD SNKVVILD I RSPT 

CTATGCCTGTTGCTGAGCTTGAAAgACATCAGGCTAGTGTGAATGCTATAGCTTGGGCGC 1860 
MPVAELERHQASVNAIAWAP 

T in ttgl. 9 

CTCAGAGCTGTAAACATATTTGTTCTGGTGGTGATGATACACAGGCTCTTATTTGGGAGC 1920 
QSCKHICSGGDDTQAL IWEL 

TTCCTACTGTTGCTGGACCCAATGGGATTGATCCGATGTCGGTTTATTCGGCTGGTTCGG 1980 
PTVAGPNGIDPMSVYSAGSE 

T in ttgl. 21 

AGATTAATCAGTTGCAGTGGTCTTCTTCGCAGCCTGATTGGATTGGTATTGCTTTTGCTA 2040 
INQLQWSSSQPDWIGIAFAN 

ACAAAATGCAGCTCCTTAGAGTTTGAGg tgagag t ttctcttt cgctaca taattctcat 2100 
KMQLLRV* 

ttgctaggcctagattctaatgaggaagcattgattattggtttagattgtgttgcatta 2160 

cagatagttctctaggtttggtaactaaacgttttttcgattcttgataacaaagccact 2220 

agagatttgacactaactcgttttagatttacctgaatcaatatctctgttaaaatcaat 2280 

tactt tgtta tgca taca taaa tcacagt t tagtagtca ta ta ta t tggc tcttat tagc 2340 

gacaggtctcacacttgctgtaatggctgatagtgtagtagtcatatgttggctttcatc 2400 

taagt tga tg ta tea t a tga tgaa tagt tgtacac teg tcaggt tc taa 1 1 1 1 ta ccca t 2460 

aattcttcagtctatttttttttgagacaatctattcttaatttaacgaagccactagct 2520 

acgtatacaaatattgttaatttaacgaagtatctgagaattgtttactgctgactctgc 2580 

tgtatgccctcagaaacatatagaagtggaattggaaacttcatgctggtttgaacatct 2640 

t tgta tgtgtgc t tcaggt ttt tgtaactca 1 1 tagacaacagca t tgca ta tatacacg 2700 

caca ta tgcaacc tagaaaa tcaaa taacc 1 1 tec t ta taa t tac ta tcca 1 1 tcac t tg 2760 

a tgtcagGTGCAGATGTGAAGTGATCAATAAGGATTTTAGCATAGACCCGTATAATCGTC 2820 

ATGTGCGTAAGTAGGTTTGGTTTGCGCTCCCTCTCGCTTTTAGGTCCGCAATGACTCTGT 2880 

ATCTATCTGATTGTAACTAAAACTGAATTCATTTGATGAACCAAATGATACTATTATCTT 2940 

ATGTTGTgtataaaacccaaccaggatatattgcggtttctggtgtttagatttggtaat 3000 

tggagcttagtacaatgcaaccctgtcttgctttattggacgtctctaagataaatcagc 3060 
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1 50 
TTGl -MDNSAPDSL SRSETAVTYD SPYPLYAMAF SSLRSSSGHR lAVGSFLEDY 

Matthiola 

Tobaccol 

Tobacco2 

Petuniaanll MENSSQESQH LRSENSVTYD STYPIYSMAF SSF.PTPRRR lAVGSFIEEL 

51 100 
TTGl NNRIDILSFD SDSMTVKPLP NLSFEHPYPP TKLMFSPPSL RRPSSGDLLA 

Matthiola FEHPYPP TKLMFSPPSL RRPSGGDLLA 

Tobaccol FEHPYPP TKLMFHPNPS ASLKSNDILA 

TobacCo2 FEHPYPP TKLMFHPNPS ASLKSNDILA 

Petuniaanll NNRVELLSFN EETLTLNPIP NLSFDHPYPP TKLMFHPNPI KS-.NNDILA 



TTGl 
Matthiola 
Tobaccol 
Tobacco2 
Petuniaanll 



101 
SSGDFLRLWE 
SSGDFLRLWE 
SSGDYLRLWE 
SSGDYLRLWE 

SSGDYLRLWE 



INEDSSTVEP ISVLNNSKTS EFCAPLTSFD 
VSEDSSTVEP VSVLNNSKTS EFCAPLTSFD 
VRE..SSIEP LFTLNNSKTS EYCAPLTSFD 
VRE..SSIEP LFTLNNSKTS EYCAPLTSFD 
VKE..SSIEP LFTLNNSKTS EYCAPLTSFD 



150 

WNDVEPKRLG 
WNDVEPKRLG 
V7NEVEPRRIG 
WDEIEPKRIG 
WNEVEPKRIG 



TTGl 
Matthiola 
Tobaccol 
Tobacco 2 
Petuniaanll 



151 
TCSIDTTCTI 
TCSIDTTCTI 
TSSIDTTCTI 
TSSTDTTCTI 
TSSIDTTCTI 



WDIEKSWET 
WDIEKSWET 
WDVEKGWQT 
WDVEKGWET 
WDVEKGWET 



QLIAHDKEVH 
QLIAHDKEVH 
QLIAHDKEGY 
QLIAHDKEVY 
QLIAHDKEVY 



DIAWGEARVF 
DIAWGEARVF 
DIAWGEAGVF 
DIAWGEDGVF 
DIAWGEAGVF 



200 

ASVSADGSVR 
ASVSADGSVR 
ASVSADGSVR 
ASVSADGSVR 
ASVSADGSVR 



TTGl 
Matthiola 
Tobaccol 
Tobacco2 
Petuniaanl 1 



201 

IFDLRDKEHS 
IFDLRDKEHS 
IFDLRDKEHS 
IFDLRDKEHS 
IFDLRDKEHS 



TIIYESPQPD 
TIIYESPQPD 
TIIYESPQPD 
TIIYESPQPD 
TIIYESPTPD 



TPLLRLAWNK 
TPLLRLAWNK 
TPLLRVAWNK 
TPLLRLAWNK 
TPLLRLAWNK 



QDLRYMATIL 
QDLRYMATIL 
QDLRYMATIL 
QDLRYMATIL 
QDLRYMATIL 



250 

MDSNKWILD 
MDSNKWILD 
MDSNKNVILD 
MDSNKIVILD 
MDSNKWILD 



TTGl 
Matthiola 
Tobaccol 
Tobacco2 
Petuniaanll 



251 

IRSPTMPVAE 
IRSPTMPVAE 
IRSPAMPVAE 
IRSPAMPVAE 
IRSPAMPVAE 



LERHQASVNA 
LERHQASVNA 
LERHQASVNA 
LERHQASVNA 
LERHQASVNA 



lAWAPQSCKH 
lAWAPQSCKH 
lAWAPQSRRH 
lAWAPQSCRH 
lAWAPQSCRH 



ICSGGDDTQA 
ICSAGDDTQA 
ICSAGDDGQA 
ICSAGDDGQA 
ICSGGDDGQA 



300 

LIWELPTVAG 
LIWELPTVAG 
LIWELPTV — 
LIWELPTVAG 
LIWELPTVAG 



301 343 
TTGl PNGIDPMSVY SAGSEINQLQ WSSSQPDWIG lAFANKMQLL RV- 

Matthiola pngidp 

Tobaccol 

Tobacco2 

Petuniaanll PNGIDPMSMY SAGAEINQLQ WSPAQRDWIA lAFSNKLQLL KV* 



Fig 4 
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1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 



TTATTTCTCC GTCTCTTGAA AAATCCGACT GACACTGACC TCAAAACTCT 
CCTCTCACTT TCGTCGTGAA GAAGCCAAAT CTCGAATCGA ATCAGCACCA 
CACATTTCCA TGGATAATTC AGCTCCAGAT TCGTTATCCA GATCGGAAAC 
CGCCGTCACA TACGACTCAC CATATCCACT CTACGCCATG GCTTTCTCTT 
CTCTCCGCTC ATCCTCCGGT CACAGAATCG CCGTCGGAAG CTTCCTCGAA 
GATTACAACA ACCGCATCGA CATTCTCTCT TTCGATTCCG ATTCAATGAC 
CGTTAAGCCT CTCCCGAATC TCTCCTTCGA GCATCCTTAT CCTCCAACAA 
AGCTAATGTT CAGTCCTCCT TCTCTCCGTC GTCCTTCCTC CGGAGATCTC 
CTCGCTTCCT CCGGC6ATTT CCTCCGTCTT TGGGAAATTA ACGAAGATTC 
ATCAACCGTC GAGCCAATCT CGGTTCTCAA CAACAGCAAA ACGAGC6AGT 
TTTGTGCGCC GTTGACTTCC TTCGATTGGA ACGATGTAGA GCCGAAACGT 
CTCGGAACTT GTAGTATTGA TACGACGTGT ACGATTTGGG ATATTGAGAA 
GTCTGTTGTT GAGACTCAGC TTATAGCTCA TGATAAAGAG GTTCATGACA 
TTGCTT6GGG AGAAGCTAGG GTTTTCGCAT CAGTCTCTGC TGATGGATCC 
6TTAGGATCT TTGATTTACG TGATAAGGAA CATTCTACAA TCATTTAC6A 
GA6TCCTCAG CCTGATACGC CTTT6TTAAG ACTTGCTTGG AACAAACAAG 
ATCTTAGATA TATGGCTACG ATTTTGATGG ATTCTAATAA GGTTGTGATT 
CTCGATATTC GTTCGCCGAC TATGCCTGTT GCTGAGCTTG AAAgACATCA 
GGCTAGTGTG AATGCTATAG CTTGGGCGCC TCAGAGCTGT AAACATATTT 
GTTCTGGTGG TGATGATACA CAGGCTCTTA TTTGGGAGCT TCCTACTGTT 
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1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 



GCTGGACCCA ATGGGATTGA TCCGATGTCG GTTTATTCGG CTGGTTCGGA 
GATTAATCAG TTGCAGTGGT CTTCTTC6CA GCCTGATTGG ATTGGTATTG 
CTTTTGCTAA CAAAATGCAG CTCCTTAGAG TTTGAGGTGC AGATGTGAAG 
T6ATCAATAA GGATTTTAGC ATAGACCCGT ATAATCGTCA TGTGCGTAAG 
TAGGTTTGGT TTGCGCTCCC TCTCGCTTTT AGGTCCGCAA T6ACTCTGTA 
TCTATCTGAT TGTAACTAAA ACTGAATTCA TTTGAT6AAC CAAATGATAC 
TATTATCTTA TGTTGTGTAT AAAACCCAAC CAGGATATAT TGCGGTTTCT 
GGT6TTTAGA TTTGGTAATT GGAGCTTAGT ACAATGCAAC CCTGTCTTGC 
TTTATTGGAC GTCTCTAA6A TAAATCA6C 
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